AITopics | sublinear dynamic regret

Collaborating Authors

sublinear dynamic regret

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Continuous Online Learning and New Insights to Online Imitation Learning

Lee, Jonathan, Cheng, Ching-An, Goldberg, Ken, Boots, Byron

arXiv.org Machine LearningDec-3-2019

Online learning is a powerful tool for analyzing iterative algorithms. However, the classic adversarial setup sometimes fails to capture certain regularity in online problems in practice. Motivated by this, we establish a new setup, called Continuous Online Learning (COL), where the gradient of online loss function changes continuously across rounds with respect to the learner's decisions. We show that COL covers and more appropriately describes many interesting applications, from general equilibrium problems (EPs) to optimization in episodic MDPs. Using this new setup, we revisit the difficulty of achieving sublinear dynamic regret. We prove that there is a fundamental equivalence between achieving sublinear dynamic regret in COL and solving certain EPs, and we present a reduction from dynamic regret to both static regret and convergence rate of the associated EP. At the end, we specialize these new insights into online imitation learning and show improved understanding of its learning stability.

algorithm, dynamic regret, sublinear dynamic regret, (15 more...)

arXiv.org Machine Learning

1912.01261

Country:

North America > United States > California > Alameda County > Berkeley (0.04)
North America > Canada (0.04)

Genre: Research Report (0.50)

Industry: Education > Educational Setting > Online (0.85)

Technology:

Information Technology > Enterprise Applications > Human Resources > Learning Management (0.85)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.34)

Add feedback

Online Learning with Continuous Variations: Dynamic Regret and Reductions

Cheng, Ching-An, Lee, Jonathan, Goldberg, Ken, Boots, Byron

arXiv.org Machine LearningFeb-19-2019

We study the dynamic regret of a new class of online learning problems, in which the gradient of the loss function changes continuously across rounds with respect to the learner's decisions. This setup is motivated by the use of online learning as a tool to analyze the performance of iterative algorithms. Our goal is to identify interpretable dynamic regret rates that explicitly consider the loss variations as consequences of the learner's decisions as opposed to external constraints. We show that achieving sublinear dynamic regret in general is equivalent to solving certain variational inequalities, equilibrium problems, and fixed-point problems. Leveraging this identification, we present necessary and sufficient conditions for the existence of efficient algorithms that achieve sublinear dynamic regret. Furthermore, we show a reduction from dynamic regret to both static regret and convergence rate to equilibriums in the aforementioned problems, which allows us to analyze the dynamic regret of many existing learning algorithms in few steps.

algorithm, dynamic regret, sublinear dynamic regret, (11 more...)

arXiv.org Machine Learning

1902.07286

Country:

North America > United States > California > Alameda County > Berkeley (0.04)
Europe > Finland > Northern Ostrobothnia > Oulu (0.04)

Genre: Research Report (0.63)

Industry: Education > Educational Setting > Online (0.84)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.84)

Add feedback